Overview

Dataset statistics

Number of variables14
Number of observations506
Missing cells120
Missing cells (%)1.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory55.5 KiB
Average record size in memory112.3 B

Variable types

NUM13
BOOL1

Warnings

TAX is highly correlated with RADHigh correlation
RAD is highly correlated with TAXHigh correlation
CRIM has 20 (4.0%) missing values Missing
ZN has 20 (4.0%) missing values Missing
INDUS has 20 (4.0%) missing values Missing
CHAS has 20 (4.0%) missing values Missing
AGE has 20 (4.0%) missing values Missing
LSTAT has 20 (4.0%) missing values Missing
ZN has 360 (71.1%) zeros Zeros

Reproduction

Analysis started2022-05-16 12:20:54.342271
Analysis finished2022-05-16 12:21:40.872821
Duration46.53 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

CRIM
Real number (ℝ≥0)

MISSING

Distinct484
Distinct (%)99.6%
Missing20
Missing (%)4.0%
Infinite0
Infinite (%)0.0%
Mean3.611873971
Minimum0.00632
Maximum88.9762
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:41.065307image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum0.00632
5-th percentile0.02739
Q10.0819
median0.253715
Q33.5602625
95-th percentile15.870875
Maximum88.9762
Range88.96988
Interquartile range (IQR)3.4783625

Descriptive statistics

Standard deviation8.72019185
Coefficient of variation (CV)2.414312326
Kurtosis36.56834838
Mean3.611873971
Median Absolute Deviation (MAD)0.218875
Skewness5.21284265
Sum1755.37075
Variance76.0417459
MonotocityNot monotonic
2022-05-16T09:21:41.275744image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.0150120.4%
 
14.333720.4%
 
0.0454410.2%
 
0.0249810.2%
 
0.0130110.2%
 
0.0615110.2%
 
0.0549710.2%
 
0.0330610.2%
 
0.0304110.2%
 
0.0342710.2%
 
Other values (474)47493.7%
 
(Missing)204.0%
 
ValueCountFrequency (%) 
0.0063210.2%
 
0.0090610.2%
 
0.0109610.2%
 
0.0130110.2%
 
0.0131110.2%
 
ValueCountFrequency (%) 
88.976210.2%
 
73.534110.2%
 
67.920810.2%
 
51.135810.2%
 
45.746110.2%
 

ZN
Real number (ℝ≥0)

MISSING
ZEROS

Distinct26
Distinct (%)5.3%
Missing20
Missing (%)4.0%
Infinite0
Infinite (%)0.0%
Mean11.21193416
Minimum0
Maximum100
Zeros360
Zeros (%)71.1%
Memory size4.0 KiB
2022-05-16T09:21:41.429335image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q312.5
95-th percentile80
Maximum100
Range100
Interquartile range (IQR)12.5

Descriptive statistics

Standard deviation23.38887615
Coefficient of variation (CV)2.086069702
Kurtosis4.132614189
Mean11.21193416
Median Absolute Deviation (MAD)0
Skewness2.256612605
Sum5449
Variance547.0395274
MonotocityNot monotonic
2022-05-16T09:21:41.587910image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%) 
036071.1%
 
20204.0%
 
80142.8%
 
22102.0%
 
25102.0%
 
12.5102.0%
 
4061.2%
 
4561.2%
 
9051.0%
 
3051.0%
 
Other values (16)407.9%
 
(Missing)204.0%
 
ValueCountFrequency (%) 
036071.1%
 
12.5102.0%
 
17.510.2%
 
1810.2%
 
20204.0%
 
ValueCountFrequency (%) 
10010.2%
 
9540.8%
 
9051.0%
 
8520.4%
 
82.520.4%
 

INDUS
Real number (ℝ≥0)

MISSING

Distinct76
Distinct (%)15.6%
Missing20
Missing (%)4.0%
Infinite0
Infinite (%)0.0%
Mean11.08399177
Minimum0.46
Maximum27.74
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:41.926520image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum0.46
5-th percentile2.18
Q15.19
median9.69
Q318.1
95-th percentile21.3125
Maximum27.74
Range27.28
Interquartile range (IQR)12.91

Descriptive statistics

Standard deviation6.835896499
Coefficient of variation (CV)0.6167359775
Kurtosis-1.217990915
Mean11.08399177
Median Absolute Deviation (MAD)6.32
Skewness0.3037221876
Sum5386.82
Variance46.72948094
MonotocityNot monotonic
2022-05-16T09:21:42.082144image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
18.112725.1%
 
19.58285.5%
 
8.14224.3%
 
6.2183.6%
 
21.89142.8%
 
9.9122.4%
 
3.97122.4%
 
8.56112.2%
 
10.59112.2%
 
5.8691.8%
 
Other values (66)22243.9%
 
(Missing)204.0%
 
ValueCountFrequency (%) 
0.4610.2%
 
0.7410.2%
 
1.2110.2%
 
1.2210.2%
 
1.2520.4%
 
ValueCountFrequency (%) 
27.7451.0%
 
25.6561.2%
 
21.89142.8%
 
19.58285.5%
 
18.112725.1%
 

CHAS
Boolean

MISSING

Distinct2
Distinct (%)0.4%
Missing20
Missing (%)4.0%
Memory size4.0 KiB
0
452 
1
 
34
(Missing)
 
20
ValueCountFrequency (%) 
045289.3%
 
1346.7%
 
(Missing)204.0%
 
2022-05-16T09:21:42.195634image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

NOX
Real number (ℝ≥0)

Distinct81
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5546950593
Minimum0.385
Maximum0.871
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:42.306022image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum0.385
5-th percentile0.40925
Q10.449
median0.538
Q30.624
95-th percentile0.74
Maximum0.871
Range0.486
Interquartile range (IQR)0.175

Descriptive statistics

Standard deviation0.1158776757
Coefficient of variation (CV)0.2089033853
Kurtosis-0.06466713337
Mean0.5546950593
Median Absolute Deviation (MAD)0.0875
Skewness0.7293079225
Sum280.6757
Variance0.01342763572
MonotocityNot monotonic
2022-05-16T09:21:42.473768image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.538234.5%
 
0.713183.6%
 
0.437173.4%
 
0.871163.2%
 
0.624153.0%
 
0.489153.0%
 
0.693142.8%
 
0.605142.8%
 
0.74132.6%
 
0.544122.4%
 
Other values (71)34969.0%
 
ValueCountFrequency (%) 
0.38510.2%
 
0.38910.2%
 
0.39220.4%
 
0.39410.2%
 
0.39820.4%
 
ValueCountFrequency (%) 
0.871163.2%
 
0.7781.6%
 
0.74132.6%
 
0.71861.2%
 
0.713183.6%
 

RM
Real number (ℝ≥0)

Distinct446
Distinct (%)88.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.284634387
Minimum3.561
Maximum8.78
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:42.837893image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum3.561
5-th percentile5.314
Q15.8855
median6.2085
Q36.6235
95-th percentile7.5875
Maximum8.78
Range5.219
Interquartile range (IQR)0.738

Descriptive statistics

Standard deviation0.7026171434
Coefficient of variation (CV)0.1117992074
Kurtosis1.891500366
Mean6.284634387
Median Absolute Deviation (MAD)0.3455
Skewness0.4036121333
Sum3180.025
Variance0.4936708502
MonotocityNot monotonic
2022-05-16T09:21:43.033373image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
5.71330.6%
 
6.16730.6%
 
6.12730.6%
 
6.22930.6%
 
6.40530.6%
 
6.41730.6%
 
6.78220.4%
 
6.95120.4%
 
6.6320.4%
 
6.31220.4%
 
Other values (436)48094.9%
 
ValueCountFrequency (%) 
3.56110.2%
 
3.86310.2%
 
4.13820.4%
 
4.36810.2%
 
4.51910.2%
 
ValueCountFrequency (%) 
8.7810.2%
 
8.72510.2%
 
8.70410.2%
 
8.39810.2%
 
8.37510.2%
 

AGE
Real number (ℝ≥0)

MISSING

Distinct348
Distinct (%)71.6%
Missing20
Missing (%)4.0%
Infinite0
Infinite (%)0.0%
Mean68.51851852
Minimum2.9
Maximum100
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:43.321600image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum2.9
5-th percentile17.95
Q145.175
median76.8
Q393.975
95-th percentile100
Maximum100
Range97.1
Interquartile range (IQR)48.8

Descriptive statistics

Standard deviation27.99951301
Coefficient of variation (CV)0.4086415412
Kurtosis-0.9821403245
Mean68.51851852
Median Absolute Deviation (MAD)20.15
Skewness-0.5824700575
Sum33300
Variance783.9727285
MonotocityNot monotonic
2022-05-16T09:21:43.507104image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
100428.3%
 
97.940.8%
 
87.940.8%
 
98.840.8%
 
9640.8%
 
95.440.8%
 
76.530.6%
 
9730.6%
 
96.230.6%
 
32.230.6%
 
Other values (338)41281.4%
 
(Missing)204.0%
 
ValueCountFrequency (%) 
2.910.2%
 
6.210.2%
 
6.510.2%
 
6.620.4%
 
6.810.2%
 
ValueCountFrequency (%) 
100428.3%
 
99.310.2%
 
99.110.2%
 
98.930.6%
 
98.840.8%
 

DIS
Real number (ℝ≥0)

Distinct412
Distinct (%)81.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.795042688
Minimum1.1296
Maximum12.1265
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:43.669214image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum1.1296
5-th percentile1.461975
Q12.100175
median3.20745
Q35.188425
95-th percentile7.8278
Maximum12.1265
Range10.9969
Interquartile range (IQR)3.08825

Descriptive statistics

Standard deviation2.105710127
Coefficient of variation (CV)0.5548580872
Kurtosis0.4879411222
Mean3.795042688
Median Absolute Deviation (MAD)1.29115
Skewness1.011780579
Sum1920.2916
Variance4.434015137
MonotocityNot monotonic
2022-05-16T09:21:44.082882image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
3.495251.0%
 
5.720940.8%
 
5.287340.8%
 
6.814740.8%
 
5.400740.8%
 
6.336130.6%
 
3.945430.6%
 
6.49830.6%
 
4.721130.6%
 
4.812230.6%
 
Other values (402)47092.9%
 
ValueCountFrequency (%) 
1.129610.2%
 
1.13710.2%
 
1.169110.2%
 
1.174210.2%
 
1.178110.2%
 
ValueCountFrequency (%) 
12.126510.2%
 
10.710320.4%
 
10.585720.4%
 
9.222910.2%
 
9.220320.4%
 

RAD
Real number (ℝ≥0)

HIGH CORRELATION

Distinct9
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.549407115
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:44.202790image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median5
Q324
95-th percentile24
Maximum24
Range23
Interquartile range (IQR)20

Descriptive statistics

Standard deviation8.707259384
Coefficient of variation (CV)0.9118115166
Kurtosis-0.8672319936
Mean9.549407115
Median Absolute Deviation (MAD)2
Skewness1.004814648
Sum4832
Variance75.81636598
MonotocityNot monotonic
2022-05-16T09:21:44.304825image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
2413226.1%
 
511522.7%
 
411021.7%
 
3387.5%
 
6265.1%
 
2244.7%
 
8244.7%
 
1204.0%
 
7173.4%
 
ValueCountFrequency (%) 
1204.0%
 
2244.7%
 
3387.5%
 
411021.7%
 
511522.7%
 
ValueCountFrequency (%) 
2413226.1%
 
8244.7%
 
7173.4%
 
6265.1%
 
511522.7%
 

TAX
Real number (ℝ≥0)

HIGH CORRELATION

Distinct66
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean408.2371542
Minimum187
Maximum711
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:44.436703image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum187
5-th percentile222
Q1279
median330
Q3666
95-th percentile666
Maximum711
Range524
Interquartile range (IQR)387

Descriptive statistics

Standard deviation168.5371161
Coefficient of variation (CV)0.4128411987
Kurtosis-1.142407992
Mean408.2371542
Median Absolute Deviation (MAD)73
Skewness0.6699559418
Sum206568
Variance28404.75949
MonotocityNot monotonic
2022-05-16T09:21:44.605753image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
66613226.1%
 
307407.9%
 
403305.9%
 
437153.0%
 
304142.8%
 
264122.4%
 
398122.4%
 
384112.2%
 
277112.2%
 
224102.0%
 
Other values (56)21943.3%
 
ValueCountFrequency (%) 
18710.2%
 
18871.4%
 
19381.6%
 
19810.2%
 
21651.0%
 
ValueCountFrequency (%) 
71151.0%
 
66613226.1%
 
46910.2%
 
437153.0%
 
43291.8%
 

PTRATIO
Real number (ℝ≥0)

Distinct46
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.4555336
Minimum12.6
Maximum22
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:44.940883image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum12.6
5-th percentile14.7
Q117.4
median19.05
Q320.2
95-th percentile21
Maximum22
Range9.4
Interquartile range (IQR)2.8

Descriptive statistics

Standard deviation2.164945524
Coefficient of variation (CV)0.1173060379
Kurtosis-0.2850913833
Mean18.4555336
Median Absolute Deviation (MAD)1.15
Skewness-0.8023249269
Sum9338.5
Variance4.686989121
MonotocityNot monotonic
2022-05-16T09:21:45.143343image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%) 
20.214027.7%
 
14.7346.7%
 
21275.3%
 
17.8234.5%
 
19.2193.8%
 
17.4183.6%
 
18.6173.4%
 
19.1173.4%
 
18.4163.2%
 
16.6163.2%
 
Other values (36)17935.4%
 
ValueCountFrequency (%) 
12.630.6%
 
13122.4%
 
13.610.2%
 
14.410.2%
 
14.7346.7%
 
ValueCountFrequency (%) 
2220.4%
 
21.2153.0%
 
21.110.2%
 
21275.3%
 
20.9112.2%
 

B
Real number (ℝ≥0)

Distinct357
Distinct (%)70.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean356.6740316
Minimum0.32
Maximum396.9
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:45.311893image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum0.32
5-th percentile84.59
Q1375.3775
median391.44
Q3396.225
95-th percentile396.9
Maximum396.9
Range396.58
Interquartile range (IQR)20.8475

Descriptive statistics

Standard deviation91.29486438
Coefficient of variation (CV)0.255961624
Kurtosis7.226817549
Mean356.6740316
Median Absolute Deviation (MAD)5.46
Skewness-2.890373712
Sum180477.06
Variance8334.752263
MonotocityNot monotonic
2022-05-16T09:21:45.499389image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
396.912123.9%
 
393.7430.6%
 
395.2430.6%
 
376.1420.4%
 
394.7220.4%
 
395.6320.4%
 
392.820.4%
 
395.5620.4%
 
390.9420.4%
 
393.6820.4%
 
Other values (347)36572.1%
 
ValueCountFrequency (%) 
0.3210.2%
 
2.5210.2%
 
2.610.2%
 
3.510.2%
 
3.6510.2%
 
ValueCountFrequency (%) 
396.912123.9%
 
396.4210.2%
 
396.3310.2%
 
396.310.2%
 
396.2810.2%
 

LSTAT
Real number (ℝ≥0)

MISSING

Distinct438
Distinct (%)90.1%
Missing20
Missing (%)4.0%
Infinite0
Infinite (%)0.0%
Mean12.7154321
Minimum1.73
Maximum37.97
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:45.677948image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum1.73
5-th percentile3.7075
Q17.125
median11.43
Q316.955
95-th percentile27.15
Maximum37.97
Range36.24
Interquartile range (IQR)9.83

Descriptive statistics

Standard deviation7.155870816
Coefficient of variation (CV)0.5627705579
Kurtosis0.5186825176
Mean12.7154321
Median Absolute Deviation (MAD)4.795
Skewness0.908891837
Sum6179.7
Variance51.20648713
MonotocityNot monotonic
2022-05-16T09:21:45.838110image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
7.7930.6%
 
6.3630.6%
 
8.0530.6%
 
14.130.6%
 
18.1330.6%
 
30.8120.4%
 
4.5920.4%
 
7.3920.4%
 
12.6720.4%
 
5.2920.4%
 
Other values (428)46191.1%
 
(Missing)204.0%
 
ValueCountFrequency (%) 
1.7310.2%
 
1.9210.2%
 
1.9810.2%
 
2.4710.2%
 
2.8710.2%
 
ValueCountFrequency (%) 
37.9710.2%
 
36.9810.2%
 
34.7710.2%
 
34.4110.2%
 
34.3710.2%
 

MEDV
Real number (ℝ≥0)

Distinct229
Distinct (%)45.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.53280632
Minimum5
Maximum50
Zeros0
Zeros (%)0.0%
Memory size4.0 KiB
2022-05-16T09:21:46.040571image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile10.2
Q117.025
median21.2
Q325
95-th percentile43.4
Maximum50
Range45
Interquartile range (IQR)7.975

Descriptive statistics

Standard deviation9.197104087
Coefficient of variation (CV)0.408165053
Kurtosis1.495196944
Mean22.53280632
Median Absolute Deviation (MAD)4
Skewness1.108098408
Sum11401.6
Variance84.58672359
MonotocityNot monotonic
2022-05-16T09:21:46.195664image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
50163.2%
 
2581.6%
 
2271.4%
 
21.771.4%
 
23.171.4%
 
19.461.2%
 
20.661.2%
 
13.851.0%
 
21.451.0%
 
20.151.0%
 
Other values (219)43485.8%
 
ValueCountFrequency (%) 
520.4%
 
5.610.2%
 
6.310.2%
 
720.4%
 
7.230.6%
 
ValueCountFrequency (%) 
50163.2%
 
48.810.2%
 
48.510.2%
 
48.310.2%
 
46.710.2%
 

Interactions

2022-05-16T09:21:08.109300image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:08.827213image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:08.962849image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:09.084664image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:09.208336image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:09.337037image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:09.456717image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:09.583808image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:09.713766image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:09.862600image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:09.978798image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:10.111445image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:10.236110image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:10.367758image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:10.491427image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:10.620120image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:10.752730image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:10.895386image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:11.122784image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:11.245707image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:11.378616image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:11.503795image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:11.635482image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:11.755162image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:11.938634image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:12.071314image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:12.189999image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:12.313668image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:12.435305image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:12.578923image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:12.700492image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:12.829656image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:12.983595image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:13.110031image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:13.232215image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:13.356880image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:13.491557image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:13.687997image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:13.819645image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:13.946342image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:14.077995image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:14.203238image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:14.350845image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:14.476508image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:14.600760image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:14.728499image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:14.864661image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:15.000288image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:15.146897image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:15.271561image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:15.424189image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:15.690441image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:15.814661image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:15.943315image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:16.076887image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:16.286974image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:16.472988image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:16.664524image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:16.821104image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:17.017584image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:17.217048image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:17.370179image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:17.501864image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:17.681372image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:17.851939image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:18.016496image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:18.169957image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:18.363950image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:18.483666image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:18.622293image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:18.746965image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:18.872625image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:19.000247image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:19.145376image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:19.286515image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:19.421161image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:19.563778image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:19.707397image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:19.837930image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:19.971104image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:20.114181image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:20.248827image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:20.411392image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:20.544033image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:20.687649image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:20.816307image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:20.947474image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:21.075644image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:21.212322image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:21.342936image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:21.496728image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:21.800719image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:21.937651image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:22.074753image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:22.234836image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:22.449773image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:23.021244image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:23.154919image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:23.289685image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:23.419547image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:23.569954image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:23.708899image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:23.895404image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:24.107414image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:24.338604image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:24.536078image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:24.711607image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:24.871181image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:25.105554image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:25.263649image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:25.407272image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:25.556864image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:25.695529image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:25.829656image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:26.099449image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:26.402639image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:26.772649image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:27.004030image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:27.212477image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:27.391993image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:27.619900image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:27.923122image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:28.042801image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:28.178438image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:28.295180image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:28.422833image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:28.538523image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:28.661202image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:28.865175image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:29.128445image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:29.339629image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:29.511201image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:29.850212image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:30.117485image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:30.444618image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:30.723414image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:30.921883image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:31.205124image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:31.621014image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:31.791556image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:32.306597image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:32.753447image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:33.136425image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:33.480514image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:33.897971image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:34.314363image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:34.714293image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:35.244874image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:35.496202image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:35.637950image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:35.790049image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:35.949660image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:36.101258image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:36.252849image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:36.393476image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:36.552052image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:36.691639image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:36.803339image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:36.914043image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:37.037487image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:37.272037image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:37.634103image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:37.816614image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:38.067944image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:38.382103image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:38.653805image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:38.906177image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:39.047801image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:39.182440image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Correlations

2022-05-16T09:21:46.337830image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-05-16T09:21:46.572706image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-05-16T09:21:46.775904image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-05-16T09:21:46.993095image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-05-16T09:21:39.562690image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:39.884827image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:40.565961image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/
2022-05-16T09:21:40.718494image/svg+xmlMatplotlib v3.5.0, https://matplotlib.org/

Sample

First rows

CRIMZNINDUSCHASNOXRMAGEDISRADTAXPTRATIOBLSTATMEDV
00.0063218.02.310.00.5386.57565.24.0900129615.3396.904.9824.0
10.027310.07.070.00.4696.42178.94.9671224217.8396.909.1421.6
20.027290.07.070.00.4697.18561.14.9671224217.8392.834.0334.7
30.032370.02.180.00.4586.99845.86.0622322218.7394.632.9433.4
40.069050.02.180.00.4587.14754.26.0622322218.7396.90NaN36.2
50.029850.02.180.00.4586.43058.76.0622322218.7394.125.2128.7
60.0882912.57.87NaN0.5246.01266.65.5605531115.2395.6012.4322.9
70.1445512.57.870.00.5246.17296.15.9505531115.2396.9019.1527.1
80.2112412.57.870.00.5245.631100.06.0821531115.2386.6329.9316.5
90.1700412.57.87NaN0.5246.00485.96.5921531115.2386.7117.1018.9

Last rows

CRIMZNINDUSCHASNOXRMAGEDISRADTAXPTRATIOBLSTATMEDV
4960.289600.09.690.00.5855.39072.92.7986639119.2396.9021.1419.7
4970.268380.09.690.00.5855.79470.62.8927639119.2396.9014.1018.3
4980.239120.09.690.00.5856.01965.32.4091639119.2396.9012.9221.2
4990.177830.09.690.00.5855.56973.52.3999639119.2395.7715.1017.5
5000.224380.09.690.00.5856.02779.72.4982639119.2396.9014.3316.8
5010.062630.011.930.00.5736.59369.12.4786127321.0391.99NaN22.4
5020.045270.011.930.00.5736.12076.72.2875127321.0396.909.0820.6
5030.060760.011.930.00.5736.97691.02.1675127321.0396.905.6423.9
5040.109590.011.930.00.5736.79489.32.3889127321.0393.456.4822.0
5050.047410.011.930.00.5736.030NaN2.5050127321.0396.907.8811.9